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rdgB and OATPRP4 genes and proteins are involved in energy homeostasis 

Description 

This invention relates to the use of nucleic acid sequences encoding rdgB 
or CG3811 homologous proteins, and the polypeptides encoded thereby 
and effectors thereof and to the use thereof in the diagnosis, study, 
prevention, and treatment of diseases and disorders related to body-weight 
regulation, for example, but not limited to, metabolic diseases or 
dysfunctions such as obesity as well as related disorders such as eating 
disorder, cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. 
cancers of the reproductive organs, and sleep apnea. 

There are several metabolic diseases of human and animal metabolism, eg., 
obesity and severe weight loss, that relate to energy imbalance where 
caloric intake versus energy expenditure is imbalanced. Obesity is one of 
the most prevalent metabolic disorders in the world. It is still a poorly 
understood human disease that becomes as a major health problem more 
and more relevant for western society. Obesity is defined as a body weight 
more than 20% in excess of the ideal body weight, frequently resulting in 
a significant impairment of health. It is associated with an increased risk 
for cardiovascular disease, hypertension, diabetes, hyperlipidaemia and an 
increased mortality rate. Besides severe risks of illness, individuals 
suffering from obesity are often isolated socially. 

Obesity is influenced by genetic, metabolic, biochemical, psychological, 
and behavioral factors. As such, it is a complex disorder that must be 
addressed on several fronts to achieve lasting positive clinical outcome. 
Since obesity is not to be considered as a single disorder but as a 
heterogeneous group of conditions with {potential) multiple causes, it is 



also characterized fay elevated fasting plasma insulin and an exaggerated 
insulin response to oral glucose intake (Koltermann, J. Clin, Invest 65^ 
1980, 1272-1284). A clear involvement of obesity in type 2 diabetes 
mellitus can be confirmed (Kopelman, Nature 404, 2000, 635-643). 

The molecular factors regulating food intake and body weight balance are 
incompletely understood. Even if several candidate genes have been 
described which are supposed to influence the homeostatic system{s) that 
regulate body mass/weight, like leptin, VCPI, VCPL or the peroxisome 
proliferator-activated receptor-gamma co-activator, the distinct^molecular 
mechanisms and/or molecules influencing obesity"- or body weight/body 
mass regulations are not known. In addition, several single-gene mutations 
resulting in obesity have been described in mice, implicating genetic factors 
In the etiology of obesity. (Friedman and Leibel, 1990, Cell 69: 217-220). 
In the ob mouse a single gene mutation (obese) results in profound obesity, 
which is accompanied by diabetes (Friedman et. al., 1991, Genomics 1 1 : 
1054-1062 ). 

Therefore, the technical problem underiying the present invention was to 
provide for means and methods for modulating (pathological) metabolic 
conditions influencing body-weight regulation and/or energy homeostatic 
circuits. The solution to said technical problem is achieved by providing the 
embodiments characterized in the claims. 

Accordingly, the present invention relates to genes with novel functions in 
body-weight regulation, energy homeostasis, metabolism, and obesity. The 
present invention discloses specific genes involved in the regulation of 
body-weight, energy homeostasis, metabolism, and obesity, and thus in 
disorders related thereto such as eating disorder^ cachexia, diabetes 
mellitus, hypertension, coronary heart disease, hypercholesterolemia, 
dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the 
reproductive organs, and sleep apnea. In particular, the present invention 



describes the human rdgB or CG381 1 homologous genes as being involved 
in those conditions mentioned above. 

The term 'GenBank Accession number' relates to NCBI GenBank database 
entries (Benson et al. Nucleic Acids Res. 28, 2000, 15-18). 

The protein tyrosine kinase PYK2 has been implicated in signaling 
pathways activated by G-protein-coupled receptors, intracellular calcium, 
and stress signals. The family of PYK2-binding proteins designated Nirs 
(PYK2 N-terminal domain-interacting receptors) consists of_three Nir 
proteins (NIrl, Nlr2, and Nir3), which bind to the arrrino-terminal domain of 
PYK2 via a conserved sequence motif located in the carboxy terminus. The 
Nir proteins are the human homologues of the Drosophlla retinal 
degeneration B protein (rdgB), a protein implicated in the visual 
transduction pathway in flies. Nirs are calcium-binding proteins that exhibit 
phosphoinositol (PI) transfer activity in vivo- Nir proteins might play a role 
in the control of calcium and phosphoinositide metabolism downstream of 
G-protein-coup!ed receptors (Lev S. et aL, (1999 Mol Cell Biol 1999 
Mar;1 9{3):2278-88). Nir2, a human homolog of Drosophifa melanogaster 
retinal degeneration B protein, is essential for cytokinesis (Litvak V. et aL, 
(2002) Mol Cell Biol 22(14):5064-5075). The phosphatidylinositol-transfer 
protein (PITP; similar to NIR2); catalyzes the transfer of 
phosphatidylinositol (PI) between membranes. PITPs are intimately 
associated with the compartmentalised synthesis of different 
phosphorylated inositol lipids. PI is the primary inositol lipid that is 
synthesised at the endoplasmic reticulum and is further phosphorylated in 
distinct membrane compartments by many specific lipid kinases to 
generate seven phosphorylated inositol lipids which are required for both 
signalling and for membrane traffic. PITPs play essential roles in both 
signalling via phospholipase C and phosphoinositide 3-kinases and in 
multiple aspects of membrane traffic including regulated exocytosis and 



vesicle biogenesis (Cockcroft S., (2001) Semin Cell Dev Biol 
12(2):183-191). 

Cellular uptake of organic solutes is mediated In large part by a gene family 
of transporters called organic anion transporting polypeptides (OATPs) (Cai 
S. Y. et al., (2002) Am J Physiol Gastrointest Liver Physiol 
282(4):G702-710). Organic anion transporting polypeptide 4 (0atp4; 
Slc21a10) is known as a liver-specific and sodium-independent transporter 
that mediates transport of a variety of compounds. The preference of 
Oatp4 for endogenous compounds coupled with its refractory response to 
known drug-metabolizing enzyme inducers suggests that Oatp4 may be 
largely responsible for the homeostasis of endogenous rather than 
exogenous chemicals, including pharmaceuticals (Li N. et aL, (2002) J 
Pharmacol Exp Ther 301 {2):551-560). The multispecific human organic 
anion transporter 4 (hOAT4) mediates the high-affinity transport of 
ochratoxin A (OTA) on the apical side of the proximal tubule, whereas the 
transport characteristics of OTA are distinct from those by basolateral 
OATs (Babu E. et aL (2002) Biochim Biophys Acta 1 590(1 •3):64-75). The 
organic anion transport system is involved in the tubular excretion and 
reabsorption of various drugs and substances. Probenecid could inhibit 
hOAT4-mediated estrone sulfate (ES) uptake in vitro. Comparing the K(i) 
values with the therapeutically relevant concentrations of unbound 
inhibitors in the plasma, probenecid was predicted to inhibit 
hOAT4-mediated organic anion transport in vivo (Enomoto A. et aL, (2002) 
J Pharmacol Exp Ther 301 (3):797-802). 

So far, it has not been described that the rdgB or CG381 1 proteins of the 
invention and homologous proteins are involved in the regulation of energy 
homeostasis and body-weight regulation and related disorders, and thus, 
no functions in metabolic diseases and dysfunctions and other diseases as 
listed above have been discussed. 



In this invention, we demonstrate that the correct gene dose of rdgB or 
CG381 1 is essential for maintenance of energy homeostasis. The fly 
Drosophila melanogaster was used as model organism for the identification 
of proteins involved in the energy homeostasis, Drosophila melanogaster is 
one of the most intensively studied organisms in biology and serves as a 
model system for the investigation of many developmental and cellular 
processes common to higher eukaryotes, including humans (see, for 
example, Adams et a\., Science 287: 2185-2195 (2000)). The success of 
Drosophila melanogaster as a model organism is largely due to the power 
of forward genejtic screens to identify the genes that are involved in a 
biological process (see, Johnston Nat Rev Genet 3: 176-188 (2002); 
Rorth, Proc Natl Acad Sci U S A 93: 12418-12422 (1996)). In this 
invention, we have used a genetic screen to identify, that mutations of 
rdgB or CG3811 homologous genes cause changes in the body weight 
which is reflected by a significant change in the triglyceride levels. 
Triglycerides are the most efficient storage for energy in cells, and are 
significantly increased in obese patients. 

In this invention we refer to retinal degeneration B (rdgB), CG3811, 
rdgB-like, or CG3811-like proteins, membrane associated 
phosphatidylinositol transfer proteins (PITPNM), PYK2 N-terminal 
domain-interacting receptors 3 (NIR3), PYK2 N-termmal domain-interacting 
receptors 1 (NIRD, or organic anion transporter polypeptide-related 
proteins 4 (OATPRP4) which include Drosophila and mammalian, preferably 
human, homologous polypeptides or proteins or sequences encoding those 
proteins. 

The present invention discloses that rdgB or CG381 1 homologous proteins 
are regulating the energy homeostasis and fat metabolism especially the 
metabolism and storage of triglycerides, and polynucleotides, which 
identify and encode the proteins disclosed in this invention. The invention 
also relates to vectors, host cells, antibodies, and recombinant methods for 



producing the polypeptides and polynucleotides of the invention. The 
invention also relates to the use of these polynucleotides, polypeptides and 
effectors thereof, e.g. antibodies, aptamers, anti-sense molecules and other 
receptors capable of interacting with the polypeptides and polynucleotides 
in the diagnosis, study, prevention, and treatment of diseases and 
disorders, for example, but not limited to, metabolic diseases such as 
obesity as well as related disorders such as eating disorder, cachexia, 
diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. 
cancers of the reproductive organs, and sleep apnea. 

-s. 

rdgB or CG3811 homologous proteins and nucleic acid molecules coding 
therefore are obtainable from insect or vertebrate species, e.g. mammals or 
birds. Particularly preferred are nucleic acids encoding the human rdgB or 
CG3811 homologs (in particular the human membrane associated 
phosphatidylinositol transfer protein (PITPNM), human PYK2 N-terminal 
.domain-interacting receptor 3 (NIR3), human PYK2 N-terminal 
domain-interacting receptor 1 (NIR1), or human organic anion transporter 
polypeptide-related protein 4 {OATPRP4)). 

The invention particularly relates to a nucleic acid molecule encoding a 
polypeptide contributing to regulating the energy homeostasis and the 
metabolism of triglycerides, wherein said nucleic acid molecule comprises 

(a) the nucleotide sequence of Drosophila rdgB or CG381 1 , human rdgB 
or CG381 1 homologs, and/or a sequence complementary thereto, 

(b) a nucleotide sequence which hybridizes at 50**C in a solution 
containing 1 x SSC and 0.1 % SDS to a sequence of (a), 

(c) a sequence corresponding to the sequences of (a) or (b) within the 
degeneration of the genetic code, 

(d) a sequence which encodes a polypeptide which is at least 85%, 
preferably at least 90%, more preferably at least 95%, more 
preferably at least 98% and up to 99,6% identical to the amino acid 



sequences of the rdgB or CG381 1 protein, preferably of the human 
rdgB or CG381 1 homologs, 
{e) a sequence which differs from the nucleic acid molecule of (a) to (d) 
by mutation and wherein said mutation causes an alteration, 
deletion, duplication and/or premature stop in the encoded 
polypeptide or 

(f) a partial sequence of any of the nucleotide sequences of (a) to (e) 
having a length of at least 1 5 bases, preferably at least 20 bases, 
more preferably at least 25 bases and most preferably at least 50 
bases. - _ 

The present invention relates to genes with novel functions in body-weight 
regulation, energy homeostasis, metabolism, and obesity, fragments of 
said genes, polypeptides encoded by said genes or fragments thereof, and 
effectors e.g. antibodies, biologically active nucleic acids, such as 
antisense molecules or ribozymes, aptamers, peptides or low-molecular 
weight organic compounds recognizing said polynucleotides or 
polypeptides. 

The invention is based on the finding that rdgB or CG3811 and the 
polynucleotides encoding these, are involved in the regulation of 
triglyceride storage and therefore energy homeostasis. To find genes with 
novel functions in energy homeostasis, metabolism, and obesity, a 
functional genetic screen was performed with the model organism 
Drosophila melanogaster (Meigen). One resource for screening was a 
Drosophila melanogaster stock collection of EP-lines. The P-vector of this 
collection has Gal4-UAS-binding sites fused to a basal promoter that can 
transcribe adjacent genomic Drosophila sequences upon binding of Gal4 to 
UAS-sites {Brand A. H. and Perrimon N., (1993) Development 
118:401-415; Rorth P., (1996) Proc Natl Acad Sci U S A 
93:1 241 8-1 2422). This enables the EP-line collection for overexpression of 
endogenous flanking gene sequences. In addition, without activation of the 



UAS-sites, integration of the EP-element into tlie gene is likely to cause a 
reduction of gene activity, and allows determining its function by 
evaluating the loss-of-function phenotype. 

Triglycerides are the nnost efficient storage for energy in cells, and obese 
patients mainly show a significant increase In the content of triglycerides. 
In order to isolate genes with a function in energy homeostasis, several 
thousand EP-lines were tested for their triglyceride content after a 
prolonged feeding period (see Examples for more detail). Lines with 
significantly changed triglyceride content were selected as positive 
candidates for further analysis. The change of triglyceride content due to 
the loss of a gene function suggests gene activities in energy homeostasis 
in a dose dependent manner that controls the amount of energy stored as 
triglycerides. 

In this invention, the content of triglycerides of a pool of flies with the 
same genotype after feeding for six days was analyzed using a triglyceride 
assay. Male flies homozygous for the integration of vectors for Drosophila 
lines HD-EP{X)1 1 173 and HD-EP{2)26828 were analyzed in an assay 
measuring the triglyceride contents of these flies, illustrated in more detail 
in the EXAMPLES section. The results of the triglyceride content analysis 
are shown in FIGURE 1 and 4, respectively. 

Genomic DNA sequences were isolated that are localized to the EP vector 
{herein HD-EP{X)1 1 173 and HD-EP{2)26828 ) integration. Using those 
isolated genomic sequences public databases like Berkeley Drosophila 
Genome Project {GadFly; see also FlyBase (1999) Nucleic Acids Research 
27:85-88) were screened thereby identifying the integration site of the 
vectors, and the corresponding gene, described in more detail in the 
EXAMPLES section. The molecular organization of the gene is shown in 
FIGURE 2 and 5, respectively. 



The present invention furtlier describes polypeptides comprising tlie amino 
acid sequences of tlie proteins of the invention and homologous proteins. 
Based upon homology, the proteins of the invention and each homologous 
protein or peptide may share at least some activity. 

The invention also encompasses polynucleotides that encode the proteins 
of the invention and homologous proteins. Accordingly, any nucleic acid 
sequence, which encodes the amino acid sequences of the proteins of the 
invention and homologous proteins, can be used to generate recombinant 
molecules that express the -proteins of the invention and hamologous 
proteins. In a particular embodiment, the invention encompasses a nucleic 
acid encoding Drosophila rdgB or CG3811, or human rdgB or CG3811 
homologs; referred to herein as the proteins of the invention. It will be 
appreciated by those skilled in the art that as a result of the degeneracy of 
the genetic code, a multitude of nucleotide sequences encoding the 
proteins, some bearing minimal homology to the nucleotide sequences of 
any known and naturally occurring gene, may be produced. The invention 
contemplates each and every possible variation of nucleotide sequence that 
can be made by selecting combinations based on possible codon choices. 

Also encompassed by the invention are polynucleotide sequences that are 
capable of hybridizing to the claimed nucleotide sequences, - and in 
particular, those of the polynucleotide encoding the proteins of the 
invention, under various conditions of stringency. Hybridization condittons 
are based on the melting temperature (Tm) of the nucleic acid binding 
complex or probe, as taught In Wahl, G. M. and S. L. Berger (1987: 
Methods Enzymol. 152:399-407) and Kimmel, A. R. (1987; Methods 
Enzymol. 152:507-511), and may be used at a defined stringency. 
Preferably, hybridization under stringent conditions means that after 
washing for 1 h with 1 x SSC and 0.1 % SDS at 50°C, preferably at 55^C, 
more preferably at 62°C and most preferably at 68°C, particularly for 1 h 
in 0.2 x SSC and 0.1 % SDS at 50°C, preferably at 55^C, more preferably 
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at 62 and most preferably at 68 a positive hybridization signal is 
observed. Altered nucleic acid sequences encoding the proteins which are 
encompassed by the invention include deletions, insertions or substitutions 
of different nucleotides resulting in a polynucleotide that encodes the same 
or a functionally equivalent protein. 

The encoded proteins may also contain deletions, insertions or 
substitutions of amino acid residues, which produce a silent change and 
result in functionally equivalent proteins. Deliberate amino acid 
substitutions may be made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the^amphlpathic nature of 
the residues as long as the biological activity of the protein is retained. 
Furthermore, the invention relates to peptide fragments of the proteins or 
derivatives thereof such as cyclic peptides, retro-inverso peptides of 
peptide mimetics having a length of at least 4, preferably at least 6 and up 
to 50 amino acids. 

Also included within the scope of the present invention are alleles of the 
genes encoding the proteins of the invention and homologous proteins. As 
used herein, an 'allele' or 'allelic sequence' is an alternative form of the 
gene, which may result from at least one mutation in the nucleic acid 
sequence. Alleles may result in altered mRNAs or polypeptides- whose 
structures or function may or may not be altered. Any given gene may 
have none, one or many allelic forms. Common mutational changes, which 
give rise to alleles, are generally ascribed to natural deletions, additions or 
substitutions of nucleotides. Each of these types of changes may occur 
alone or in combination with the others, one or more times in a given 
sequence. 

The nucleic acid sequences encoding the proteins of the invention and 
homologous proteins may be extended utilizing a partial nucleotide 



sequence and employing various methods known in the art to detect 
upstream sequences such as promoters and regulatory elements. 

In order to express a biologically active protein, the nucleotide sequences 
encoding the proteins or functional equivalents, may be inserted into 
appropriate expression vectors, i.e., a vector which contains the necessary 
elements for the transcription and translation of the inserted coding 
sequence. Methods, which are well known to those skilled in the art, may 
be used to construct expression vectors containing sequences encoding 
the proteins and„the appropriate transcriptional and translational control 
elements. Regulatory elements include for example a-promoter, an initiation 
codon, a stop codon, a mRNA stability regulatory element, and a 
polyadenylation signal. Expression of a polynucleotide can be assured by (i) 
constitutive promoters such as the Cytomegalovirus (CMV) 
promoter/enhancer region, (ii) tissue specific promoters such as the insulin 
promoter (see, Soria et ah, 2000, Diabetes 49:157), S0X2 gene promoter 
(see Li et al., 1998, Curr. Biol. 8:971-4), Msi-1 promoter (see Sakakibara 
etal., 1997, J. Neuroscience 17:8300-8312), alpha-cardfa myosin heavy 
chain promoter or human atrial natriuretic factor promoter (Klug et ah, 
1996, J. din. Invest 98:216-24; Wu et al., 1989, J. Biol. Chem. 
264:6472-79)or (lii) inducible promoters such as the tetracycline inducible 
system. Expression vectors can also contain a selection agent or marker 
gene that confers antibiotic resistance such as the neomycin, hygromycin 
or puromycin resistance genes. These methods include in vitro recombinant 
DNA techniques, synthetic techniques, and in vivo genetic recombination. 
Such techniques are described in Sambrook, J. et aL (1989) Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y. 
and Ausubel, F.M. et al. (1989) Current Protocols in Molecular Biology, 
John Wiley & Sons, New York, N.Y. 

In a further embodiment of the invention, natural, modified or recombinant 
nucleic acid sequences encoding the proteins of the invention and 
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homologous proteins may be ligated to a heterologous sequence to encode 
a fusion protein. 

A variety of expression vector/host systems may be utilized to contain and 
express sequences encoding the proteins or fusion proteins. These Include, 
but are not limited to, micro-organisms such as bacteria transformed with 
recombinant bacteriophage, plasmid or cosmid DNA expression vectors; 
yeast transformed with yeast expression vectors; insect ceil systems 
infected with virus expression vectors (e.g., baculovirus, adenovirus, 
adeno-associated virus, lentiverus, retrovirus); plant cell systems 
transformed with virus expression vectors (e^g., catilif lower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors 
(e.g., Ti or PBR322 plasmids); or animal cell systems. 

The presence of polynucleotide sequences of the invention in a sample can 
be detected by DNA-DNA or DNA-RNA hybridization or amplification using 
probes or portions or fragments of said polynucleotides. Nucleic acid 
amplification based assays involve the use of oligonucleotides or oligomers 
based on the sequences specific for the gene to detect transformants 
containing DNA or RNA encoding the corresponding protein. As used 
herein 'oligonucleotides' or 'oligomers' refer to a nucleic acid sequence of 
at least about 10 nucleotides and as many as about 60 nucleotides, 
preferably about 1 5 to 30 nucleotides, and more preferably about 20-25 
nucleotides, which can be used as a probe or amplimer. 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid 
assays. Means for producing labeled hybridization or PGR probes for 
detecting polynucleotide sequences include oligo-labeling, nick translation, 
end-labeling of labeled RNA probes, PGR amplification using a labeled 
nucleotide, or enzymatic synthesis. These procedures may be conducted 
using a variety of commercially available kits {Pharmacia & Upjohn, 
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(Kalamazoo, Mich.); Promega (Madison Wis.); and U.S. Biochemical Corp., 
(Cleveland, Ohio). 

The presence of proteins of the invention in a sample can be determined by 
immunological methods or activity measurement. A variety of protocols for 
detecting and measuring the expression of proteins, using either polyclonal 
or monoclonal antibodies specific for the protein or reagents for 
determining protein activity are known in the art. Examples Include 
enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies^ reactive to two 
non-interfering epitopes on the protein is preferred, but a competitive 
binding assay may be employed. These and other assays are described, 
among other places, in Hampton, R. et al. (1990; Serological Methods, a 
Laboratory Manual, APS Press, St Paul, Minn.) and Maddox, D. E. et al. 
(1983; J. Exp. Med. 158:1211-1216). 

Suitable reporter molecules or labels, which may be used, include 
radionuclides, enzymes, fluorescent, chemiluminescent or chromogenic 
agents as well as substrates, co-factors, inhibitors, magnetic particles, and 
the like. 

The nucleic acids encoding the proteins of the invention can be used to 
generate transgenic animal or site specific gene modificatfons in cell lines. 
Transgenic animals may be made through homologous recombination, 
where the normal locus of the genes encoding the proteins of the invention 
is altered. Alternatively, a nucleic acid construct is randomly integrated into 
the genome. Vectors for stable integration include plasmids, retrovirusses 
and other animal virusses, YACs, and the like. The modified cells or animal 
are useful in the study of the function and regulation of the proteins of the 
invention. For example, a series of small deletions and/or substitutions may 
be made in the genes that encode the proteins of the invention to 
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determine the role of particular domains of the protein, functions in 
pancreatic differentiation, etc. 

Specific constructs of interest include anti-sense molecules, which will 
block the expression of the proteins of the invention, or expression of 
dominant negative mutations. A detectable marker, such as for example 
lac-Z, may be introduced in the locus of the genes of the invention, where 
upregulation of expression of the genes of the invention will result in an 
easily detected change in phenotype. 

One may also provide for expression of the genes of the invention or 
variants thereof in cells or tissues where it is not normally expressed or at 
abnormal times of development. In addition, by providing expression of the 
proteins of the invention in cells in which they are not normally produced, 
one can induce changes in cell behavior. 

DNA constructs for homologous recombination will comprise at least 
portions of the genes of the invention with the desired genetic 
modification,, and will include regions of homology to the target locus. DNA 
constructs for random integration need not include regions of homology to 
mediate recombination. Conveniently, markers for positive and/or negative 
selection are included. Methods for generating cells having targeted gene 
modifications through homologous recombination are known in the art. For 
embryonic stem (ES) cells, an ES cell line may be employed, or embryonic 
cells may be obtained freshly from a host, e.g. mouse, rat, guinea pig etc. 
Such cells are grown on an appropriate fibroblast-feeder layer or grown in 
presence of leukemia inhibiting factor (LIF). 

When ES or embryonic cells or somatic pluripotent stem cells have been 
transformed, they may be used to produce transgenic animals. After 
transformation, the cells are plated onto a feeder layer in an appropriate 
medium. Cells containing the construct may be detected by employing a 
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selective medium- After sufficient time for colonies to grow, they are 
picked and analyzed for the occurrence of homologous recombination or 
integration of the construct. Those colonies that are positive may then be 
used for embryo manipulation and blastocyst injection. Blastocysts are 
obtained from 4 to 6 week old superovulated females. The ES cells are 
trypsinlzed, and the modified cells are injected into the blastocoel of the 
blastocyst. After injection, the blastocysts are returned to each uterine 
horn of pseudopregnant females. Females are then allowed to go to term 
and the resulting offspring screened for the construct. By providing for a 
different phenotype of the bFastocyst and the genetically modified cells, 
chimeric progeny can be readily detected. The "^chimeric animals are 
screened for the presence of the modified gene and males and females 
having the modification are mated to produce homozygous progeny. If the 
gene alterations cause lethality at some point in development, tissues or 
organs can be maintained as allogenic or congenic grafts or transplants, or 
in vitro culture. The transgenic animals may be any non-human mammal, 
such as laboratory animal, domestic animals, etc. The transgenic animals 
may be used in functional studies, drug screening, etc. 

Diagnostics and Therapeutics 

The data disclosed in this invention show that the nucleic acids and 
proteins of the invention are useful in diagnostic and therapeutic 
applications implicated, for example but not iimited to, in metabolic 
disorders such as obesity as well as related disorders such as eating 
disorder, cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemla, osteoarthritis, gallstones, cancer, e.g. 
cancers of the reproductive organs, and sleep apnea. Hence, diagnostic 
and therapeutic uses for the proteins of the invention nucleic acids and 
proteins of the invention are, for example but not limited to, the following: 
(i) protein therapeutic, (ii) small molecule drug target, (iii) antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic 
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and/or prognostic marker, (v) gene therapy (gene delivery/gene ablation), 
(vi) research tools, and (vii) tissue regeneration in vitro and in vivo 
(regeneration for all these tissues and cell types composing these tissues 
and cell types derived from these tissues). 

The nucleic acids and proteins of the invention and effectors thereof are 
useful in diagnostic and therapeutic applications implicated in various 
applications as described below. For example, but not limited to, cDNAs 
encoding the proteins of the invention and particularly their human 
homologues may be useful -in gene therapy, and the proteins of the 
invention and particularly their human homologues may be useful when 
administered to a subject in need thereof. By way of non-limiting example, 
the compositions of the present invention will have efficacy for treatment 
of patients suffering from, for example, but not limited to, in metabolic 
disorders as described above. 

The nucleic acids of the invention or fragments thereof, may further be 
useful in diagnostic applications, wherein the presence or amount of the 
nucleic acids or the proteins are to be assessed. Further antibodies that 
bind immunospecifically to the novel substances of the invention may be 
used in therapeutic or diagnostic methods. 

For example, in one aspect, antibodies, which are specific for the proteins 
of the invention and Homologous proteins, may be used directly as an 
effector, e.g. an antagonist or indirectly as a targeting or delivery 
mechanism for bringing a pharmaceutical agent to cells or tissue which 
express the protein. The antibodies may be generated using methods that 
are well known in the art. Such antibodies may include, but are not limited 
to, polyclonal, monoclonal, chimeric single chain. Fab fragments, and 
fragments produced by a Fab expression library. Neutralising antibodies, 
(i.e., those which inhibit dimer formation) are especially preferred for 
therapeutic use. 
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For the production of antibodies, various hosts including goats, rabbits, 
rats, mice, humans, and others, may be immunized by injection with the 
protein or any fragment or oligopeptide thereof which has immunogenic 
properties. Depending on the host species, various adjuvants may be used 
to increase immunological response. It is preferred that the peptides, 
fragments or oligopeptides used to induce antibodies to the protein have an 
amino acid sequence consisting of at least five amino acids, and more 
preferably at least 10 amino acids. 

Monoclonal antibodies to the proteins may be prepared using any 
technique that provides for the production of antibody molecules by 
continuous cell lines in culture. These include, but are not limited to, the 
hybrldoma technique, the human B-cell hybridoma technique, and the 
EBV-hybridoma technique {Kohler, G. et aL (1975) Nature 256:495-497; 
Kozbor, D. et aK (1985) J. Immunol. Methods 81:31-42; Cote, R. J. et al. 
Proc. Natl. Acad. Sci. 80:2026-2030; Cole, S. P. et al. (1984) Mol. Cell 
Bfol. 62:109-120). 

In addition, techniques developed for the production of 'chimeric 
antibodies', the splicing of mouse antibody genes to human antibody genes 
to obtain a molecule with appropriate antigen specificity and biological 
activity can be used (Morrison, S. L. et al. (1984) Proc. Natl. Acad. Sci. 
81:6851-6855; Neuberger, M. S. et al (1984) Nature 312:604-608; 
Takeda, S. et al. (1985) Nature 314:452-454). Alternatively, techniques 
described for the production of single chain antibodies may be adapted, 
using methods known in the art, to produce single chain antibodies specific 
for the proteins of the invention and homologous proteins. Antibodies with 
related specificity, but of distinct idiotypic composition, may be generated 
by chain shuffling from random combinatorial immunoglobulin libraries 
(Burton, D. R. (1991) Proc. Nati. Acad. Sci. 88:1 1 120-3). Antibodies may 
also be produced by inducing in vivo production in the lymphocyte 
population or by screening recombinant immunoglobulin libraries or panels 
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of highly specific binding reagents as disclosed in the literature (Orlandl, R. 
et al. (1989) Proc. Natl. Acad. Sci. 86:3833-3837; Winter, G. et at. (1991) 
Nature 349:293-299). 

Antibody fragments which contain specific binding sites for the proteins 
may also be generated. For example, such fragments include, but are not 
limited to, the F(ab')2 fragments which can be produced by Pepsin 
digestion of the antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of F{ab')2 fragments. 
Alternatively, Fab expression-libraries may be constructed to aUow rapid 
and easy identification of monoclonal Fab fragments with the desired 
specificity (Huse, W. D. et al. (1989) Science 254:1275-1281). 

Various immunoassays may be used for screening to Identify antibodies 
having the desired specificity. Numerous protocols for competitive binding 
and immunoradiometric assays using either polyclonal or monoclonal 
antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation 
between the protein and its specific antibody. A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reacive to 
two non-interfering protein epitopes are preferred, but a competitive 
binding assay may also be employed (Maddox, supra). 

In another embodiment of the invention, the polynucleotides of the 
invention or fragments thereof or nucleic acid effector molecules such as 
aptamers, antisense molecules, ribozymes or RNAi molecules may be used 
for therapeutic purposes. In one aspect, aptamers, i.e. nucleic acid 
molecules capable of binding to a target protein may be obtained by a 
selection procedure involving the use of combinatorial nucleic acid libraries. 



In a further aspect, antisense molecules may be used in situations in which 
it would be desirable to block the transcription of the mRNA. In particular. 
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cells may be transformed with sequences complementary to 
polynucleotides encoding the proteins of the invention and homologous 
proteins. Thus, antisense molecules may be used to modulate protein 
activity or to achieve regulation of gene function. Such technology Is now 
well known in the art, and sense or antisense oligomers or larger 
fragments, can be designed from various locations along the coding or 
control regions of sequences encoding the proteins. Expression vectors 
derived from retroviruses, adenovirus, herpes or vaccinia viruses or from 
various bacterial plasmids may be used for delivery of nucleotide 
sequences to the targeted organ, tissue or cell population. Methods, which 
are well known to those skilled in the art, can i)e used to construct 
recombinant vectors, which will express antisense molecules 
complementary to the polynucleotides of the genes encoding the proteins 
of the Invention and homologous proteins. These techniques are described 
both in Sambrook et at. (supra) and in Ausubel et al. (supra). Genes 
encoding the proteins of the invention and homologous proteins can be 
turned off by transforming a cell or tissue with expression vectors, which 
express high levels of polynucleotides that encode the proteins of the 
invention and homologous proteins or fragments thereof. Such constructs 
may be used to introduce untranslatable sense or antisense sequences into 
a cell. Even in the absence of integration into the DNA, such vectors may 
continue to transcribe RNA molecules until they are disabled by 
endogenous nucleases. Transient expression may last for a month or more 
with a non-replicating vector and even longer if appropriate replication 
elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by 
designing antisense molecules, e.g. DNA, RNA or nucleic acid analogs such 
as PNA, to the control regions of the genes encoding the proteins of the 
invention and homologous proteins, i.e., the promoters, enhancers, and 
introns. Oligonucleotides derived from the transcription initiation site, e.g., 
between positions -10 and +10 from the start site, are preferred. Similariy, 
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inhibition can be achieved using "triple helix" base-pairing methodology. 
Triple helix pairing is useful because it cause inhibition of the ability of the 
double helix to open sufficiently for the binding of polymerases, 
transcription factors or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described in the literature {Gee, J. E. et ai, 
(1994) In; Huber, B. E. and B. L Carr, Molecular and Immunologic 
Approaches, Futura Publishing Co,, Mt. Kisco, N.Y.). The antisense 
molecules may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the 
specific cleavage of RNA. The mechanism of ribozyme action involves 
sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. Examples, which may 
be used, include engineered hannmerhead motif ribozyme molecules that 
can be specifically and efficiently catalyze endonucleolytic cleavage of 
sequences encoding the proteins of the invention and homologous 
proteins. Specific ribozyme cleavage sites within any potential RNA target 
are initially identified by scanning the target molecule for ribozyme 
cleavage sites which mclude the following sequences: GUA, GUU, and 
GUC. Once identified^ short RNA sequences of between 15 and 20 
ribonucleotides corresponding to the region of the target gene containing 
the cleavage site may be evaluated for secondary structural features which 
may render the oligonucleotide inoperable. The suitability of candidate 
targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Nucleic acid effector molecules of the invention may be prepared by any 
method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as 
solid phase phosphoramidite chemical synthesis. Alternatively, RNA 
molecules may be generated by in vitro and in vivo transcription of DNA 
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sequences. Such DNA sequences may be Incorporated Into a variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. 
Alternatively, these cDNA constructs that synthesize antisense RNA 
constitutively or induclbly can be introduced into cell lines, cells or tissues. 
RNA molecules may be modified to increase intracellular stability and 
half-life. Possible modifications include, but are not limited to, the addition 
of flanking sequences at the 5' and/or 3' ends of the molecule or 
modifications in the nucleobase, sugar and/or phosphate moieties, e.g. the 
use of phosphorothioate or 2' O-methyl rather than phosphodiesterase 
linkages within the backbone-of the molecule. This concept Is inherent In 
the production of FN As and can be extended in all -of these molecules by 
the inclusion of non-traditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 
of adenine, cytidine, guanine, thymine, and uridine which are not as easily 
recognized by endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and 
equally suitable for use in vivo. In vitro, and ex vivo. For ex vivo therapy, 
vectors may be introduced into stem cells taken from the patient and 
clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection and by liposome injections may be achieved using 
methods, which are well known in the art. Any of the therapeutic methods 
described above may be applied to any suitable subject including, for 
example, mammals such as dogs, cats, cows, horses, rabbits, monkeys, 
and most preferably, humans. 

An additional embodiment of the Invention relates ta the administration of 
a pharmaceutical composition, in conjunction with a pharmaceutically 
acceptable carrier, for any of the therapeutic effects discussed above. 
Such pharmaceutical compositions may consist of the nucleic acids and the 
proteins of the invention and homologous nucleic acids or proteins, 
antibodies to the proteins of the invention and homologous proteins. 
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mimetics, agonists, antagonists or Inhibitors of the proteins of the 
invention and homologous proteins or nucleic acids. The compositions may 
be administered alone or in combination with at least one other agent, such 
as stabilizing compound, which may be administered in any sterile, 
biocompatible pharmaceutical carrier, including, but not limited to, saline, 
buffered saline, dextrose, and water. The compositions may be 
administered to a patient alone or in combination with other agents, drugs 
or hormones. The pharmaceutical compositions utilized in this invention 
may be administered by any number of routes including, but not limited to, 
oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual or rectal mearis. 

In addition to the active ingredients, these pharmaceutical compositions 
may contain suitable pharmaceutically-acceptable carriers comprising 
excipients and auxiliaries, which facilitate processing of the actfve 
compounds into preparations, which can be used pharmaceutically. Further 
details on techniques for formulation and administration may be found in 
the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co., Easton, Pa.). 

Pharmaceutical compositions suitable for use in the invention .include 
compositions wherein the active ingredients are contained in an effective 
amount to achieve the intended purpose. The determination of an effective 
dose is well within the capability of those skilled in the art. For any 
compounds, the therapeutically effective does can be estimated initially 
either in cell culture assays, e.g., of preadipocyte celt lines or in animal 
models, usually mice, rabbits, dogs or pigs. The animal model may also be 
used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful 
doses and routes for administration in humans. A therapeutically effective 
dose refers to that amount of active ingredient, for example the nucleic 
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acids or the proteins of the invention and homologous proteins or nucleic 
acids or fragments thereof, antibodies of the proteins of the invention and 
homologous proteins, which is sufficient for treating a specific condition. 
Therapeutic efficacy and toxicity may be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., 
ED50 (the dose therapeutically effective in 50% of the population) and 
LD50 (the dose lethal to 50% of the population). The dose ratio between 
therapeutic and toxic effects is the therapeutic index, and it can be 
expressed as the ratio, LD50/ED50. Pharmaceutical compositions, which 
exhibit large therapeutic indices, are preferred. The data obtained from cell 
culture assays and animal studies is used in formulating a range of dosage 
for human use. The dosage contained in such compositions is preferably 
within a range of circulating concentrations that include the ED50 with 
little or no toxicity. The dosage varies within this range depending upon the 
dosage from employed, sensitivity of the patient, and the route of 
administration. The exact dosage will be determined by the practitioner, in 
light of factors related to the subject that requires treatment. Dosage and 
administration are adjusted to provide sufficient levels of the active moiety 
or to maintain the desired effect. Factors, which may be taken into 
account, include the severity of the disease state, general health of the 
subject, age, weight, and gender of the subject, diet, time and frequency 
of administration, drug combination(s), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions 
may be administered every 3 to 4 days, every week or once every two 
weeks depending on half-life and clearance rate of the particular 
formulation. Normal dosage amounts may vary from 0.1 to 100,000 
micrograms, up to a total dose of about t g, depending upon the route of 
administration. Guidance as to particular dosages and methods of delivery 
is provided in the literature and generally available to practitioners in the 
art* Those skilled in the art employ different formulations for nucleotides 
than for proteins or their inhibitors. Similarly, delivery of polynucleotides or 
polypeptides will be specific to particular cells, conditions, locations, etc. 
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In another embodiment, antibodies which specifically bind to the proteins 
may be used for the diagnosis of conditions or diseases characterized by or 
associated with over- or underexpression of the proteins of the invention 
and homologous proteins or in assays to monitor patients being treated 
with the proteins of the invention and homologous proteins, or effectors 
thereof, e.g. agonists, antagonists, or inhibitors- Diagnostic assays include 
methods which utilize the antibody and a label to detect the protein in 
human body fluids or extracts of cells or tissues. The antibodies may be 
used with or without modification, and may be labeled by joining them, 
either covalently or non-covalently, with a reporter molecule. A wide 
variety of reporter molecules which are known ir^the art may be used 
several of which are described above. 

A variety of protocols including ELISA, RIA, and FACS for measuring 
proteins are known in the art and provide a basis for diagnosing altered or 
abnormal levels of gene expression. Normal or standard values for gene 
expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, preferably human, with antibodies to the 
protein under conditions suitable for complex formation. The amount of 
standard complex formation may be quantified by various methods, but 
preferably by photometric, means. Quantities of protein expressed in 
control and disease samples, e.g. from biopsied tissues may be compared 
with the standard values. Deviation between standard and subject values 
establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides specific for 
the proteins of the invention and homologous proteins may be used for 
diagnostic purposes. The polynucleotides, which may be used, include 
oligonucleotide sequences, antisense RNA and DNA molecules, and PNAs. 
The polynucleotides may be used to detect and quantitate gene expression 
in biopsied tissues in which gene expression may be correlated with 
disease. The diagnostic assay may be used to distinguish between 
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absence, presence, and excess gene expression, and to monitor regulation 
of protein levels during therapeutic intervention. 

In one aspect, hybridization with probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding the 
proteins of the invention and homologous proteins or closely related 
molecules, may be used to identify nucleic acid sequences which encode 
the respective protein. The hybridization probes of the subject invention 
may be DNA or RNA and derived from the nucleotide sequence of the 
polynucleotide encoding the proteins of the invention or from a genomic 
sequence including promoter, enhancer element?, and introns of the 
naturally occurring gene. Hybridization probes may be labeled by a variety 
of reporter groups, for example, radionuclides such as ^^P or ^^S or 
enzymatic labels, such as alkaline phosphatase coupled to the probe via 
avidin/biotin coupling systems, and the like. 

Polynucleotide sequences specific for the proteins of the invention and 
homologous nucleic acids may be used for the diagnosis of conditions or 
diseases, which are associated with the expression of the proteins. 
Examples of such conditions or diseases include, but are not limited to, 
pancreatic diseases and disorders, including diabetes. Polynucleotide 
sequences specific for the proteins of the invention and homologous 
proteins may also be used to monitor the progress of patients receiving 
treatment for pancreatic diseases and disorders. Including diabetes. The 
polynucleotide sequences may be used qualitative or quantitative assays, 
e.g. in Southern or Northern analysis, dot blot or other membrane-based 
technologies; in PGR technologies; or In dip stick, pin, ELISA or chip assays 
utilizing fluids or tissues from patient biopsies to detect altered gene 
expression. 

In a particular aspect, the nucleotide sequences specific for the proteins of 
the invention and homologous nucleic acids may be useful in assays that 
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detect activation or induction of various metabolic diseases such as obesity 
as well as related disorders such as eating disorder, cachexia, diabetes 
mellitus, hypertension, coronary heart disease, hypercholesterolemia, 
dyslipidemia, osteoarthritis, gallstones, cancers of the reproductive organs, 
and sleep apnea. The nucleotide sequences may be labeled by standard 
methods, and added to a fluid or tissue sample from a patient under 
conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is 
quantitated and compared with a standard value. The presence of altered 
levels of nucleotide sequences encoding the proteins of the invention and 
homologous proteins in a sample indicates the presfence of the associated 
disease. Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regimen in animal studies, in clinical trials 
or in monitoring the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disease associated with 
expression of the proteins of the invention and homologous proteins, a 
normal or standard profile for expression is established. This may be 
accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with a sequence or a fragment thereof, 
which is specific for the nucleic acids encoding the proteins of the 
invention and homologous nucleic acids, under conditions suitable for 
hybridization or amplification. Standard hybridization may be quantified by 
comparing the values obtained from normal subjects with those from an 
experiment where a known amount of a substantially purified 
polynucleotide is used. Standard values obtained from normal samples may 
be compared with values obtained from samples from patients who are 
symptomatic for disease. Deviation between standard and subject values 
is used to establish the presence of disease. Once disease is established 
and a treatment protocol is initiated, hybridization assays may be repeated 
on a regular basis to evaluate whether the level of expression in the patient 
begins to approximate that, which is observed in the normal patient. The 



- 27 - 

results obtained fronn successive assays may be used to show the efficacy 
of treatment over a period ranging from several days to months. 

With respect to metabolic diseases such. as described above the presence 
of an unusual amount of transcript in biopsied tissue from an individual 
may indicate a predisposition for the development of the disease or may 
provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow 
health professionals to employ preventative measures or aggressive 
treatment earlier thereby preventing the development or further progression 
of the metabolic diseases and disorders. 

Additional diagnostic uses for oligonucleotides designed from the 
sequences encoding the proteins of the invention and homologous proteins 
may involve the use of PGR. Such oligomers may be chemically 
synthesized, generated enzymatically or produced from a recombinant 
source. Oligomers will preferably consist of two nucleotide sequences, one 
with sense orientation {5'.fwdarw.3') and another with antisense 
O'.rarw.B'), employed under optimized conditions for Identification of a 
specific gene or condition. The same two oligomers, nested sets of 
oligomers or even a degenerate pool of oligomers may be employed under 
less stringent conditions for detection and/or quantification of. closely 
related DNA or RNA sequences. 

in another embodiment of the invention, the nucleic acid sequences may 
also be used to generate hybridization probes, which are useful for 
mapping the naturally occurring genomic sequence. The sequences may be 
mapped to a particular chromosome or to a specific region of the 
chromosome using well known techniques. Such techniques include FISH, 
FACS or artificial chromosome constructions, such as yeast artificial 
chromosomes, bacterial artificial chromosomes, bacterial PI constructions 
or single chromosome cDNA libraries as reviewed in Price, G. M. (1993) 
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Blood Rev. 7:127-134, and Trask, B. J. (1991) Trends Genet, 7:149-154. 
FISH (as described in Verma et al. (1988) Hunnan Chromosomes: A Manual 
of Basic Techniques, Pergamon Press, New York, N.Y.). The results may 
be correlated with other physical chromosome mapping techniques and 
genetic map data. Examples of genetic map data can be found in the 1 994 
Genome Issue of Science (265:1 981 f). Correlation between the location of 
the gene encoding the proteins of the invention on a physical chromosomal 
map and a specific disease or predisposition to a specific disease, may help 
to delimit the region of DNA associated with that genetic disease. 

The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected 
individuals. In situ hybridization of chromosomal preparations and physical 
mapping techniques such as linkage analysis using established 
chromosomal markers may be used for extending genetic maps. Often the 
placement of a gene on the chromosome of another mammalian species, 
such as mouse, may reveal associated markers even if the number or arm 
of a particular human chromosome Is not known. New sequences can be 
assigned to chromosomal arms or parts thereof, by physical mapping. This 
provides valuable information to investigators searching for disease genes 
using positional cloning or other gene discovery techniques. Once the 
disease or syndrome has been crudely localized by genetic linkage to a 
particular genomic region, for example, AT to 1 1 q22-23 (Gatti, R. A. et al. 
(1988) Nature 336:577-580), any sequences mapping to that area may 
represent associated or regulatory genes for further investigation. The 
nucleotide sequences of the subject invention may also be used to detect 
differences in the chromosomal location due to translocation, inversion, 
etc. among normal, carrier or affected individuals. 

In another embodiment of the invention, the proteins of the invention, its 
catalytic or immunogenic fragments or oligopeptides thereof, an in vitro 
model, a genetically altered cell or animal, can be used for screening 
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libraries of compounds in any of a variety of drug screening teciiniques. 
One can identify effectors, e.g. receptors, ligands or substrates that bind 
to, modulate or mimic tiie action of one or more of the proteins of the 
invention. The protein or fragment thereof employed in such screening may 
be free in solution, affixed to a solid support, borne on a cell surface, or 
located intracellularly. The formation of binding complexes, between the 
proteins of the invention and the agent tested, may be measured. Of 
particular interest are screening assays for agents that have a low toxicity 
for mammalian cells. The term "agent" as used herein describes any 
molecule, e.g. protein or pharmaceutical, with the capability of altering or 
mimicking the physiological function of one or more of the proteins of the 
invention. Candidate agents encompass numerous chemical classes, 
though typically they are organic molecules, preferably small organic 
compounds having a molecular weight of more than 50 and less than 
about 2,500 Daltons. Candidate agents comprise functional groups 
necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or 
carboxyl group, preferably at least two of the functional chemical groups. 
Thecandfdate agents often comprise carbocyclic or heterocyclic structures 
and/or aromatic or polyaromatic structures substituted with one or more of 
the above functional groups. 

Candidate agents are also found among biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, nucleic acids and 
derivatives, structural analogs or combinations thereof. Candidate agents 
are obtained from a wide variety of sources including libraries of synthetic 
or natural compounds. For example, numerous means are available for 
random and directed synthesis of a wide variety of organic compounds and 
biomolecules, including expression of randomized oligonucleotides and 
oligopeptides. Alternatively, libraries of natural compounds in the form of 
bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural or synthetically produced libraries and 
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compounds are readily modified through conventional chemical, physical 
and biochemical means, and may be used to produce combinatorial 
libraries. Known pharmacological agents may be subjected to directed or 
random chemical modifications, such as acylation, alkylation, esterification, 
amidification, etc, to produce structural analogs. Where the screening 
assay is a binding assay, one or more of the molecules may be joined to a 
label, where the label can directly or indirectly provide a detectable signal. 

Another technique for drug screening, which may be used, provides for 
high throughput screening of compounds having suitable binding affinity to 
the protein of interest as described in publi^ed PCT application 
WO84/03564. In this method, as applied to the proteins of the invention 
large numbers of different small test compounds, e.g. aptamers, peptides, 
low-molecular weight compounds etc., are provided or synthesized on a 
solid substrate, such as plastic pins or some other surface. The test 
compounds are reacted with the proteins or fragments thereof, and 
washed. Bound proteins are then detected by methods well known in the 
art. Purified proteins can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid 
support. In another embodiment, one may use competitive drug screening 
assays in which neutralizing antibodies capable of binding the- protein 
specifically compete with a test compound for binding the protein. In this 
manner, the antibodies can be used to detect the presence of any peptide, 
which shares one or more antigenic determinants with the protein. 

Finally, the invention also relates to a kit comprising at least one of 

(a) a rdgB or CG381 1 nucleic acid molecule or a fragment thereof; 

(b) a vector comprising the nucleic acid of (a); 

(c) a host cell comprising the nucleic acid of (a) or the vector of (b); 

(d) a polypeptide encoded by the nucleic acid of (a); 

(e) a fusion polypeptide encoded by the nucleic acid of (a); 
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(f) an antibody, an aptamer or another receptor against the nucleic acid 
of (a) or the polypeptide of (d) or (e) and 

(g) an anti-sense oligonucleotide of the nucleic acid of (a). 

The kit may be used for diagnostic or therapeutic purposes or for screening 
applications as described above. The kit may further contain user 
instructions. 

The Figures show: 

FIGURE 1 shows the triglyceride content of DrosopWIa retinal degeneration 
B (rdgB, GadFly Accession Number CG1 11 11) mutants. Shown Is the 
change of triglyceride content of HD-EP{X)1 1 173 flies caused by 
integration of the P-vector into the into the annotated transcription unit 
{column 2) in comparison to controls containing all flies of the EP collection 
{'EP-control', column 1). 

FIGURE 2 shows the molecular organization of the mutated retinal 
degeneration B (rdgB, Gadfly Accession Number CG1 1111) gene focus. 

FIGURE 3 shows the comparison (CLUSTAL W 1 .82 multiple sequence 
alignment) of human phosphatidylinositoi transfer protein, human PYK2 
N-terminal domain-interacting receptor 3, human PYK2 N-terminal 
domain-interacting receptor 1, and Drosophila retinal degeneration B 
protein. Gaps in the alignment are represented as 

NIR3_Hs refers to human PYK2 N-terminal domain-interacting receptor 3 
(NIR3, NCBI LocusID: 57605); 

NIR1_Hs refers to human PYK2 N-terminal domain-interacting receptor 1 
(NIR1, NCBI LocusID: 83394); 

PITPNM Hs refers to human membrane-associated phosphatidylinositol 
transfer protein (PITPNM, similar to PYK2 N-terminal domain-interacting 
receptor 2 (NIR2), NCBI LocusID: 9600). 
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FIGURE 4 shows the triglyceride content of Drosophila CG381 1 (GadFly 
Accession Number) mutants. Shown is the change of triglyceride content 
of HD-EP(2)26828 flies caused by integration of the P-vector into the 
annotated transcription unit (column 2) in comparison to controls 
containing all flies of the EP collection ('EP-control', column 1). 

FIGURE 5 shows the molecular organization of the mutated CG3811 
(Gadfly Accession Number) gene locus. 

The examples illustrate the invention: 

Example 1 : Measurement of triglyceride content 

Mutant flies are obtained from a fly mutation stock collection. The flies are 
grown under standard conditions known to those skilled in the art. In the 
course of the experiment, additional feedings with bakers yeast 
(Saccharomyces cerevisiae) are provided for the EP-lines HD-EP(X)ni73 
and HD-EP(2)26828. The average change of triglyceride content of 
Drosophila containing the EP-vector as homozygous viable integration was 
investigated in comparison to control flies (see FIGURES 1 and 4, 
respectively). For determination of triglyceride content, flies were 
incubated for 5 min at 90°C in an aqueous buffer using a waterbath, 
followed by hot extraction. After another 5 min incubation at 90*^C and 
mild centrifugation, the triglyceride content of the flies extract was 
determined using Sigma Triglyceride (INT 336-10 or -20) assay by 
measuring changes in the optical density according to the manufacturer's 
protocol. As a reference the protein content of the same extract was 
measured using BIO-RAD DC Protein Assay according to the 
manufacturer's protocol. These experiments and assays were repeated 
several times. 
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The average triglyceride level of all flies of the EP collection (referred to as 
'EP-controD is shown as 100% In the first colunnns in FIGURES 1 and 4. 
Standard deviations of the measurements are shown as thin bars. 

HD-EP{X)1 1 173 homozygous flies show constantly a lower triglyceride 
content than the controls {column 2 in FIGURE 1, 'HD-EP1 1.1 73'). 
HD-EP(2)26828 homozygous flies show constantly a higher triglyceride 
content than the controls (column 2 in FIGURE 4, 'HD-EP26828'}. 
Therefore, the loss of gene activity is responsible for changes in the 
metabolism of the energy storage triglycerides. 

Example 2: Identification of Drosophila genes associated with energy 
homeostasis 

(i) rdgB 

Genomic DNA sequences were isolated that are localized directly adjacent 
to the EP vector {herein HD-EP{X) 1 1 1 73) integration. Using those isolated 
genomic sequences public databases like Berkeley Drosophila Genome 
Project (GadFly) were screened thereby confirming the hemizygous viable 
integration site of the HD-EP(X) 11173 vector 43 base pairs 5' of expressed 
sequence tag (EST) GH09970, which overlaps with the cDNA of rdgB in 
sense orientation. FIGURE 2 shows the molecular organization of this gene 
locus. The chromosomal localization site of integration of the vector of 
HD-EP{X)1 1 173 is at gene locus X, 12C2 {according to Flybase), 12B8 
(according to Gadfly release 2). In FIGURE 2, the genomic, DNA sequence 
is represented by the assembly as a black arrow in the lower half of the 
figure that Includes the integration site of HD-EP{X)1 1 1 73. Ticks represent 
the length in basepairs of the genomic DNA (10000 base pairs per tick). 
The insertion site of the P-element in Drosophila HD-EP(X)1 1 173 line is 
shown as triangle and is labeled. Dark grey bars, linked by dark grey lines 
represent cDNAs of the predicted gene (as predicted by the Berkeley 
Drosophila Genome Project, GadFly release 3). Predicted exons of the 
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Drosophila cDNA rdgB (GadFIy Accession Number CGI 1 1 1 1 ) are shown as 
dark grey bars and predicted introns as slim grey lines in the middle of the 
figure. Therefore, expression of the cDNA encoding rdgB could be affected 
by integration of vectors of line HD-EP(X) 1 1 1 73, leading to a change in the 
amount of energy storage triglycerides, 

(ii) CG3811 

Genomic DNA sequences were isolated that are localized directly adjacent 
to the EP vector (herein HD-EP{2)26828) integration. Using those isolated 
genomic sequences public databases like Berkeley Drosophila Genome 
Project (GadFIy) were screened thereby confirming the homozygous viable 
integration site of the HD-EP(2)26828 into the first intron of CG381 l-RA 
and CG381 1-RB in sense orientation 5' of the translation start. FIGURE 5 
shows the molecular organization of this gene locus. The chromosomal 
localization site of integration of the vector HD-EP(2)26828 is at gene 
locus 2U 30B5 (according to FlyBase), 30B10 (according to GadFIy release 
3). In FIGURE 5, the genomic DNA sequence is represented by the 
assembly as a black arrow In the lower half of the figure that includes the 
integration site of HD-EP(2)26828, Ticks represent the length in basepairs 
of the genomic DNA (10000 base pairs per tick}. The insertion site of the 
P-element in Drosophila HD-EP(2)26828 line is shown as triangle and is 
labeled. Dark grey bars, linked by dark grey lines represent cDNAs of the 
predicted genes (as predicted by the Berkeley Drosophila Genome Project, 
GadFIy release 3). Predicted exons of the Drosophila cDNA CG381 1 
(GadFIy Accession Number) are shown as dark grey bars and predicted 
introns as slim grey lines in the middle of the figure. Therefore, expression 
of the cDNA encoding CG381 1 could be affected by integration of vector 
of line HD-EP{2)26828, leading to a change in the amount of energy 
storage triglycerides. 
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Table 1 is summarizing the data of our molecular analysis of the Drosophila 
proteins identified in this invention as being involved in the regulation of 
the metabolism. 

Table 1. Molecular analysis of Drosophila rdgB and CG381 1 



Analysis 


Genetic interaction 


rdgB 


Arrestin 2 (metarhodopsin binding), Su(rdgB)116, Su(rdgB)83, 
neither inactivation nor afterpotential E (blue-sensitive opsin, G- 
protein coupled receptor), no receptor potential A (1- 
phosphatidylinositol-4, 5-bisphosphate phosphodiesterase), 
su(rdgB)69, su(rdgB)82, transient receptor potential (calcium 
channel, light-activated voltage-gated calcium channel), 
inactivation no afterpotential C (diacylglycerol- 
activated/ phospholipid dependent protein kinase C, protein 
serine/threonine kinase) 


CG3811 


not described (Flybase) 
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AnalysiX 


Protein 


rdgB 


calcium-dependent phosphatidylinositol transporter involved in 
phototransduction which is a component of the subrhabdomeral 
cisterna (Flyhase) 


CG3811 _ 


organic anion transporter (Flybase) 


Analysis 


Protein domains 


rdgB 


Phosphatidylinositol transfer protein, Transthyretin (prealbumin) 

(Flybase) 


CG3811 


Kazal-type serine protease inhibitor family, Ovomucoid/PCM 
like inhibitors (Flybase) 


Analysis 


InterPro analysis - 


rdgB 


Phosphatidylinositol transfer protein (IPR001666), DDHD domain 
nPR004177^ 


CG3811 


Organic anion transporter polypeptide (OATP), N-terminal 
(IPR004156), Organic anion transporter polypeptide (OATP), C- 
terminal nPR0041«57^ 


Analysis 


Locus 


rdgB 


X. 12C2 (Flvbase). X 12B8 (Gadflv release 2^ 


CG3811 


2L. 30B5 (RvbaseV. 2L. 30B10 (Gadfly release 3^ 


Analysis 


Ests 


rdgB 


RE45373 GH09984, GH02230, GH09970, GH25928, GH05951, 
AT06956, AT02250, AT31618, ATI 461 5, HL04242, GH06915, 
_.GH06815. HL05647. HL04901. HL05761 (Gadfly release 3) 


CG3811 


RH67848, RE26384, RE26532, RE67085, RE71033, RE05190, 
RE69990, RE64714, RE40836, RE67711, RE74669, LD12848, 
LD44888, AT16857, LD14805, BE977664, AT03431, AT02757, 
AT04082, AT16279, AT17989, AT04057, AT01127, RH66792, 
AT01570. GH04717. AT23359, GH0913S, GH27!53'i, AT24045 


Analysis 


CDNA 


rdgB 


AA698303 (745 base pairs mRNA), AI1 13602 (711 base pairs 
mRNA), AW941440 (6/9 base pairs mRNA), AW941749 (489 base 
pairs mRNA), AY051422 (4188 base pairs mRNA; 
protein :AAK92846), X57978 (4308 base pairs mRNA; 
protein :CAA41 044), Y08035 (4310 base pairs mRNA; 
Drotein:CAA69291 UFlvbase^ 


CG3811 


AI064338 (577 base pairs mRNA), AI401965 (539 base pairs mRNA) 
(Rvbase) 
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Analysis 


genomic DNA 


rdgB 


AE003493 (308092 base pairs DNA; protein :AAF483 15, 1250 amino 
ariH-?; protein! AAF4831 6. 1259 amino acids) fFlvbase) 


CG3811 


AFnO^fi?"! (7Anft74 base pairs DNA; protein :AAF52781^ ^Flvbase^ 


Analysis 


NrmioriJSlD ..... _ 


rdgB 


32340, Dm rdgB, retinal degeneration B, 12B8-12B9 (Aliases: x36, 
otal, DrdgB, hypoF, CG11111, CT31083, CT31182, DrdgBalpha; 
EefSeq: NM 078594; Nucleotide: AE003493, Z32300, AW941440, 
AW941749, AY051422, X57978, Y08035; Protein: NP_511149 (1250 
amino acids), AAF48315 (1250 amino acids), AAF48316 (1259 
amino acids), AAK92846 (1259 amino acids), CAA41044 (1054 
amino acids), CAA69291 (1250 amino acids) 


CG3811 


34268, Dm CG3811, 30B10 (Aliases: CT12753; RefSeq: 
NM_1 35449; Nucleotide: AE003625, AI064338, AI401965; Bcotein: 
NP 609293 (1163 amino acids), AAF52781 ni9T amino acids) 


Analysis 


Drosophila mutations a mutants 


rdgB 


There are 38 recorded alleles, 5 in vitro constructs, 32 classical 
mutants, and 1 wild-type (Rybase) 


CG3811 


not described (Rybase) 


Analysis 


Phenotypic info 


rdgB 


Mutations have been isolated v/hich affect the photoreceptor 
cell R7 to R8, the retina, the lamina and 2 other listed tissues 
and are neurophysiology defective and visual behaviour 
defective. Among findings on rdgB mutants, retarding the light 
induced photoreceptor degeneration in rdgB mutants by calcium 
channel blockers suggests that toxic increase in intracellular 
calcium by means of voltage gated calcium channel leads to 
photoreceptor degeneration in mutants. ^Flybase) 


CG3811 


not described ^Flybase) 



Example 3: Identification of human rdgB and CG3811 genes and proteins 



rdgB or CGSSII homologous proteins and nucleic acid molecules coding 
therefor are obtainable from insect or vertebrate species, e.g. mammals or 
birds. Particularly preferred are nucleic acids comprising Drosophila rdgB, 
CG3811, or human rdgB or CGSSII homologs {in particular the human 
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membrane associated phosphatidylinositol transfer protein (PITPNM), 
human PYK2 N-terminal domain-interacting receptor 3 (NIR3), human PYK2 
N-terminal domain-interacting receptor 1 (NIRI), or human organic anion 
transporter polypeptide-related protein 4 (OATPRP4)). 
Sequences homologous to Drosophila rdgB or CG381 1 were identified 
using the publicly available program BLASTP 2.2.3 of the non-redundant 
protein data base of the National Center for Biotechnology Information 
{NCBIXsee, Altschul et al., 1997, Nucleic Acids Res. 25:3389-3402). 
Table 2 shows the best human homologs of the Drosophila rdgB or 
CG3811. 

Table 2. Human homolog proteins to Drosophila rdgB or CG381 1 proteins 
L rdgB 

NCBI {National Center for Biotechnology Information) human locus 
identification (ID): 9600, Hs PITPNM, phosphatidylrnositol transfer protein, 
membrane-associated, 1 1 q1 3 
Aliases: NIR2, DRES9 

RefSeq[R]: GenBank Accession Number NM_004910 

Nucleotide: GenBank Accession Numbers AF334584, BC022230, X98654 
Protein: GenBank Accession Numbers NP 004901 {1244 amino acids), 
AAK01444 (1244 amino acids), AAH22230 {1243 amino acids), 
CAA67224 (1244 amino acids) 

* * Patents: GenBank Accession Number NP_004901 shows 99% identity 
to AAW59358 Human retinal degeneration B2 polypeptide, disclosed in 
W09816639-A1{SUGEN INC.), 

* NCBI (National Center for Biotechnology Information) human locus 
identification (ID): 57605, Hs NIR3, PYK2 N-terminal domain-interacting 
receptor 3, 12q24.31 

Aliases: Rdgb2, KIAA1457 

RefSeq[R]: GenBank Accession Number XM_038799 
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Nucleotide: GenBank Accession Number AB040890, AF334585, 
AL133612 

Protein: XP_038799 (1349 amino acids), BAA95981 (989 amino acids), 
AAK01445 (1349 amino acids), CAB63741 (877 amino acids)* ***** 
Patents: GenBanIc Accession Number XP_038799 shows 100% identity to 
AAW59359 Human retinal degeneration B3 polypeptide (hrdgBS), disclosed 
in W09816639-A1 (SUGEN INC.), 

NCBI (National Center for Biotechnology Information) human locus 
identification (ID): 83394, Hs NIR1, PYK2 N-terminal domain-interacting 
receptor 1, 17p13 

RefSeq[R]: GenBank Accession Number NM_031220 

Nucleotide: GenBank Accession Numbers AF334586, AL389994 

Protein: GenBank Accession Numbers NP_1 12497 (974 amino acids), 

AAK01446 (974 amino acids), CAB97544 (912 amino acids) 

Patents: GenBank Accession Number NP_1 12497 shows 

100% identity to AAW59357 Human retinal degeneration B1 polypeptide 

(hrdgBI), disclosed in W09816639-A1 (SUGEN INC.) 

II. CG3811 

NCBI (National Center for Biotechnology Information) human locus 

identification (ID): 81796, Hs 0ATPRP4, organic anion transporter 

polypeptide-related protein 4, 8q13.1 

RefSeq: GenBank Accession Number Ni\/l_030958 

Nucleotide: GenBank Accession Numbers AF205075, AK022760 

Protein: GenBank Accession Numbers NP_1 12220, AAG42207 (both 848 

amino acids) 

Patents: GenBank Accession Number NP_1 12220 shows 99% identity to 
AAB49147 Human organic anion transport protein RP4 protein, disclosed 
in WO200071566-A2 (BRISTOL-MYERS SQUIBB CO.) 

The mouse homologous cDNAs encoding the polypeptides of the invention 
were identified as GenBank Accession Numbers NM_008851 (for the 
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mouse homolog to PITPNM), NM_011256 (for the mouse homolog to 
NIR3.), XM_1 26447 (for the mouse homolog to NIR1) for the mouse 
homologs to rdgB, and GenBank Accession Number XM_1 36452 for the 
mouse homolog to CG381 1 . 



■ . H'. fill 

Claims '*0«-aB^ 

1 . A pharmaceutical composition comprising a nucleic acid molecule of 
5 the PYK2 N-termlnal domain-interacting receptor (rdgB) gene family, 

or the organic anion transporter polypeptide-related protein (CG 
381 1) gene family or a polypeptide encoded thereby or a fragment 
or a variant of said nucleic acid molecule or said polypeptide or an 
effector of said nucleic acid molecule or polypeptide together with 
10 pharmaceutically acceptable carriers, diluents and/or adjuvants. 

2. The composition of claim 1 , wherein the nucleic acid molecule is a 
vertebrate or insect rdgB or CG3811 nucleic acid, partrculary 
encoding the human rdgB or CG3811 homologs, and/or a nucleic 

^5 molecule which is complementary thereto or a fragment thereof or a 

variant thereof. 



The composition of claim 1 or 2, wherein said nucleic acid molecule 

(a) hybridizes at 50°C m a solution containing 1 x SSC and 0.1 % 
SDS to a nucleic acid molecule as defined in claim 2 and/or a 
nucfeic acid molecule which is complementary thereto; 

(b) it IS degenerate with respect to the nucleic acid molecule of 
(a), 

(c) encodes a polypeptide which is at least 85%, preferably at 
least 90%, more preferably at least 95%, more preferably at 
least 98% and up to 99,6% identical to the human rdgB or 
CG381 1, as defined in claim 2; 

(d) differs from the nucleic acid molecule of (a) to (c) by mutation 
and wherein said mutation causes an alteration, deletion, 
duplication or premature stop in the encoded polypeptide. 



- 42 - 

The composition of any one of claims 1-3, wherein the nucleic acid 
molecule is a DNA molecule, particularly a cDNA or a genomic DNA. 

The composition of any one of claims 1-4, wherein said nucleic acid 
encodes a polypeptide contributing to regulating the energy 
homeostasis and/or the metabolism of triglycerides. 

The composition of any one of claims 1-5, wherein said nucleic acid 
molecule is a recombinant nucleic acid molecule. 

The composition of any one of claims 1-6, wherein the nucleic acid 
molecule is a vector, particularly an expression vector. 

The composition of any one of claims 1-5, wherein the polypeptide 
is a recombinant polypeptide. 

The composition of claim 8, wherein said recombinant polypeptide is 
a fusion polypeptide. 

20 10. The composition of any one of claims 1-7, wherein said nucleic acid 
molecule is selected from hybridization probes, primers and 
anti-sense oligonucleotides. 

11. The composition of any one of claims 1-10 which is a diagnostic 
25 composition. 

12. The composition of any one of claims 1-10 which is a therapeutic 
composition. 



5. 

5 

6. 

10 

7. 
8. 

15 

9. 
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13. The composition of any one of claims 1-12 for the manufacture of 
an agent for detecting and/or verifying, for the treatment, alleviation 
and/or prevention of an disorders, including metabolic diseases such 
as obesity and other body-weight regulation disorders as well as 
related disorders such as eating disorder, cachexia, diabetes 
mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemla, osteoarthritis, gallstones, 
cancer, e.g. cancers of the reproductive organs, and sleep apnea 
and others, in cells, cell masses, organs and/or subjects. 



14. Use of a nucleic acid molecule of the PYK2 N-terminal 
domain-interacting receptor (rdgB) gene family, or the organic anion 
transporter polypeptide-related protein (CG3811) gene family or a 
polypeptide encoded thereby or a fragment or a variant of said 
15 nucleic acid molecule or said polypeptide or an effector of said 

nucleic or polypeptide for controlling the function of a gene and/or a 
gene product which is influenced and/or modified by a rdgB or 
CG3811 homologous polypeptide. 

20 15. Use of the nucleic acid molecule of the PYK2 N-terminal 
domain-interacting receptor (rdgB) gene family, or the organic anion 
transporter polypeptide-related protein (CG3811) gene family or a 
polypeptide encoded thereby or a fragment or a variant of said 
nucleic acid molecule or said polypeptide or an effector of said 

25 nucleic acid molecule or polypeptide for identifying substances 

capable of interacting with a rdgB or CG381 1 homologous 
polypeptide. 



30 



16. 



A non-human transgenic animal exhibiting a modified expression of 
a rdgB or CG381 1 homologous polypeptide. 
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17. The animal of claim 16, wherein the expression of the rdgB or 
CG3811 homologous polypeptide is increased and/or reduced. 

18. A recombinant host cell exhibiting a modified expression of a rdgB 
5 or CG381 1 homologous polypeptide. 

19. The cell of claim 18 which is a human cell. 

20. A method of identifying a (poly)peptide involved in the regulation of 
10 energy homeostasis and/or metabolism of triglycerides in a mammal 

comprising the steps of 

(a) contacting a collection of {poly)peptides with a rdgB or 
CG381 1 homologous polypeptide or a fragment thereof under 
conditions that allow binding of said (poly)peptides; 
15 (b) removing (poly)peptides which do not bind and 

(c) identifying {poly)peptides that bind to said rdgB or CG381 1 
homologous polypeptide. 

21 . A method of screening for an agent which modulates the interaction 
20 of a rdgB or CG3811 homologous polypeptide with a binding 

target/agent, comprising the steps of 
(a) mcubating a mixture comprising 

(aa) a rdgB or CG381 1 homologous polypeptide or a fragment 
thereof; 

25 (ab) a binding target/agent of said rdgB or CG381 1 homologous 

polypeptide or fragment thereof; and 

(ac) a candidate agent 

under conditions whereby said rdgB or CG3811 polypeptide or 
fragment thereof specifically binds to said binding target/agent at a 
30 reference affinity; 
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(b) detecting the binding affinity of said rdgB or CG3811 
polypeptide or fragment thereof to said binding target to 
determine an (candidate) agent-biased affinity; and 

(c) determining a difference between (candidate) agent-biased 
5 affinity and the reference affinity. 

22. A method of screening for an agent which modulates the activity of 
a rdgB or CG381 1 homologous polypeptide comprising the steps of: 
(a) incubating a mixture comprising 
10 (aa) a rdgB or CG3811 homologous polypeptide or a 

fragment thereof; 
(ab) a candidate agent; 
under condition wherein said rdgB or CG3811 polypeptide or 
fragment thereof exhibits a reference activity; 
15 (b) detecting the activity of said rdgB or CG381 1 polypeptide or 

fragment thereof in the presence of said candidate agent and 
(c) determining a difference between activity in the presence of 
said candidate agent and a reference activity. 



23. A method of producing a composition comprising the {poly)peptide 
identified by the method of claim 20 or the agent Identified by the 
method of claim 2 lor 22 with a pharmaceutically acceptable carrier, 
diluent and/or adjuvant. 

24. The method of claim 23 wherein said composition is a 
pharmaceutical composition for preventing, alleviating or treating of 
diseases and disorders, including metabolic diseases such as obesity 
and other body-weight regulation disorders as well as related 
disorders such as eating disorder, cachexia, diabetes mellitus, 
hypertension, coronary heart disease, hypercholesterolemia, 
dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the 
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reproductive organs, and sleep apnea and other diseases and 
disorders. 

Use of a (poly)peptide as identified by tJie method of claim 20 or of 
an agent as identified by the method of claim 21 or 22 for the 
preparation of a pharmaceutical composition for the treatment, 
alleviation and/or prevention of of diseases and disorders, including 
metabolic diseases such as obesity and other body-weight regulation 
disorders as well as related disorders such as eating disorder, 
cachexia, diabetes mellitus, hypertension, coronary heart disease, 
hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, 
cancer, e.g. cancers of the reproductive organs, and sleep apnea 
and other diseases and disorders. 

Use of a nucleic acid molecule of the PYK2 N-terminal 
domain-rnteracting receptor (rdgB) gene family, or the organic anion 
transporter polypeptide-related protein (CG381 1) gene family or of a 
fragment thereof for the preparation of a non-human animal which 
over- or under-expresses the rdgB or CG381 1 gene product. 

Kit comprising at least one of 

(a) a rdgB or CG3811 nucleic acid molecule or a fragment 
thereof; 

(b) a vector cbmprising the nucleic acid of (a); 

(c) a host cell comprising the nucleic acid of (a) or the vector of 
(b); 

(d) a polypeptide encoded by the nucleic acid of (a); 

(e) a fusion polypeptide encoded by the nucleic acid of (a); 

(f) an antibody, an aptamer or another receptor against the 
nucleic acid of (a) or the polypeptide of (d) or (e) and 

(g) an anti-sense oligonucleotide of the nucleic acid of (a). 
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Abstract 

The present invention discloses rdgB or CG3811 liomologous proteins 
regulating the energy homeostasis and the metabolism of triglycerides, and 
polynucleotides, which identify and encode the proteins disclosed in this 
invention. The invention also relates to the use of these sequences in the 
diagnosis, study, prevention, and treatment of diseases and disorders, for 
example, but not limited to, metabolic disorders and diseases such as the 
metabolic syndrome, including obesity, eating disorder, cachexia, diabetes 
melfitus, hypertension, coronary heart disease, Siypercholesterolemia, 
dysfipidemia, osteoarthritis, gallstones, cancers of the reproductive organs, 
and sleep apnea. 
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FIGURE 1. Triglyceride content of a Drosophila retinal degeneration B (rdgB, 
GadFly Accession Number CG11111) mutant 



140 




EP-control HD-EP11173 



II II II 

II II II 

II II II 

II nil 



CO 



* 8 



8 



3/? 



FIGURE 3. CLUSTAL W (1.82) multiple sequence alignment using rdgB protein 
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MIIKEYRIPLPMTVEEyRIAQIiyMIQKKSRNETYGEGSGVEILENRPYTDGPGGSGQYTH 

^Q:lIKEyHII,LPMSLDEYQVAQLyMIQKKSREESSGEGSGVEIr,ANRPYTDGP^ 
MLIKEYRIPLPLTVEEYRIAQLYMIAKKSREESHGEGSGVEIIINEPYKDGP6GNGQYTK 

KVYHVGMHIPSWFRSILPKAALRWEESWNAYPYTRTRFTCPFVEKFSIDIETFYKTDAG 

KWHVGSHIPGWFRALLPKAALQVEEESWNAYPYTRTRYTCPFVEKFSIEIETYYL^ 
KIYHVGNHLPGWIKSIiLPKSAIiTVEEEAWNAYPYTRTRYTCPFVEKFSLDIETYYYPDNG 

ENPDVFNLSPVEKNQIiTIDFIDIVKDPVPHNEYKTEEDPKLFQSTKTQRGPLSENWIEEY 

QQPim^NLSGAERRQRILDTIDIVRDAVAPGEYKAEEDPRLYHSVKTGRGPLSDDWAR 
YQDm^QLSGSDLRNRIVDVIDIVKDQLWGGDYVKEEDPKHFVSDKTGRGPIAEDWLEEY 



KKQVF PIMCAYKLCKVEFRYWGMQSKIERFIHDTGLRRVMVRAHRQAWCW 

AAQTG PMCAYKIiCKVEFRYWGMQAKIEQFIHDVGLRRVMLRAHRQAWCW 

WREVKGKKQPTPRNMSLMTAYKICRVEFRYWGMQTKLEKFIHDVALRKMMLRAHRQAWAW 

QDEWYGLSMENIRELEKEAQLMIiSRKMAQFNEDGEEATEIi VKHEAVSDQ 

~ ^MAKAGR 

QDEWTELSMADIRALEEETARMLAQRMAKCWTGSEGSEAQPPGKPSTEARSAASN 

QDEWFGLTIEDIRELERQTQLAIAKKMGGGEECSDDSVSEPYVSTAATAASTTGSERKKS 



■►TSGEPPEPSSSN-— GEPLVGRGLKKQWSTSSKSSRSSKRGASPSRH- 



-AGGPPP- 



-GGGAP 



TGTPDGPEAPPG — PDASPDASFGKQWSSSSRSSYSSQHGGAVSPQ- 

APAVPPXVTQQPPSAEASSDEEGEEEEDDDEDBNDAXGTGVDLSANQGGSAQRSRSQSXQ 



SISEWRMQSXARDSDESSDDEFFDAHEDLSDTEEM 

WHLRNVIiSDSVESSDDEFFDAREEMAEGKNA 

Sr.SEWRMQNXARDSENSSEEEFFDAHEGFSDSEEV 

MAQKGKFGSKGALHSPVGSAHSFDLQVANWRMERLEVDSKSNSDEEFFDCLDTNETNSLA 

FPKDITKWSSNDLMDKIBSPEPEDTQDG r.yRQGAPEPRVASSV~EQLN- 

XLIGMSQWNSNDI.VEQXETMGKI.DEHQGEGTAPCTSSILQEKQRELYRVSLRR QRFPA 

FPKEMTKWNSNDFXDAFASP VEAEG TPEPGAEAAKGIEDG—AQAPR 

KWSSLELLGEGDDSPPPHGGPSSAASVGGRGNSRQEDSXFNQDPLMRVASERGNKRQLRS 

NXR3_Hs XIEDEVS— QPLAAPPSKIHVLLLVLHGGTILDTGAGDPSSKK6DANTXANVFDTV 

NIR1_HS QGSIEIHEDSE—EGCPQRSCKTHVLLLVLHGGNILDTGAGDPSCKAADIHTFSSVLEKV 

PITPNM_HS D SEGIiDGA—GELGAEACAVHALFLXLHSGNXLDSGPGDANSKQADVQTLSSAFEAV 

CGI 1 1 1 1-PB SASVDRSHDSSPPGSPSTPSCPTTrLXLWHAGSVLDAAS-ELTAKKSDVTTFRGSPBAV 

NIR3_HS MRVHYPSALGRLAIRLVPCPPVCS0APALVSNLSPYSHDEGCLSSSQD HIPLAALPL 

NIR1_HS TRAHFPAAX^GHILIKFVPCPAICSEAFSLVSHLNPYSHDEGCLSSSQD HVPLAALPL 

PITPNM_HS TRIHFPEALGHVALRLVPCPPICAAAYAI,VSNI.SPYSHDGDSLSRSQD HXPLAALPL 

CGI 1 1 1 1-PB MRQHYPSIiLTHVTXKMVPCPSICTDALGXLSSIiSPYSFDASPSAADXPNXADVPXGAXPI, 

NIR3_HS 3QATSSPQYQEAVATVIQRANLAYGDFIKSQEGMTFNGQVCLXGDCVGGILAFDALCYSNQ 

NIR1_HS LAXSSPQYQDAVATVIERANQVYREFLKSSDGIGFSGQVCLIGDCVGGLLAFDAICYSAG 

PITPNM_HS LA'TSSSRYQGAVATVIARTNQAYSAFLRSPEGAGFCGQVALIGDGVGGXIiGFDALCHSAN 

CGI 1 1 1 1-PB liSVASPEFHETVNKTVAAANlVYHEFLKSEEGHGFSGQXVMLGDSMGSLLAYEALCRSNG 



^VSES-QSSSRRGSWSMQDNDI,X.SPGILMNAAHCCGGGGGGGGGGGSSGGGGSSGGSqT 
NIR1_HS PSGDSPASSSRKGSISSTQDTPVA oov^^bbi. 

PITPNM_Hs AGTGS-RGSSRRGSMNNELLSPEFG 

CGI 1 1 1 1-PB SQPGTASGASNSGGDAATN — 
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PVRDPLA 
X 
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PITPNM_Hs 
CGlllll-PB 

NIR3_Hs 
NIR1_HS 
PITPNM_HS 
CGlllll-PB 

NIR3_Hs 
NIR1_HS 
PITPNM_Hs 
CGlllll-PB 

NIR3_Hs 
NIR1_HS 
PITPNM_HS 
CGlllll-PB 

NIR3_Hs 
NIR1_HS 
PITPNM_Hs 
CGlllll-PB 

NIR3_Hs 
NIRl_Hs 
PITPNM_Hs 
CGlllll-^PB 

NIR3_Hs 
NIRl_Hs 
PITPNM_Hs 
CGlllll-PB 

NIR3_Hs 
NIRl_Hs 
PITPNM_Hs 
CGlllll-PB 

NIR3_Hs 
NIRl_Hs 
PITPNM_Hs 
CGlllll-PB 

NIR3_HS 
NIRl_Hs 
PITPNM_HS 
CGlllll-PB 

NrR3 HS 



ESSRHLSRSNVDIPRSNGTEDPKRQLPRKRSDSSTYELDTXQQHQAFLSSLHASVLRTEP 
ASSKRLSKSNIDISS6LEDEEPKRPLPRKQSDSSTYDCEAITQHHAFLSSIHSSVLKDES 

DGVEGLGRGSPEPS ALPPQRIPSDMASPEPEGSQNSLQAAPATTSSWEPRRASTA 

NTHNPLSPRN SRLDDDERFIEADLDAKRLLVAPSPRRRRSSS 

CSRHSSSSTMLDGTGALGRFDFEITDLFLFGCPLGLVIALRKTVIPALDVFQI.RPACOOV 
ETPAAGGPQLPEVS-^-LGRFDFDVSDFFLFGSPLGLVTJU^RTVLPGLDGFQVRPACSO^ 
FCPPAASSEAPDGPSSTARLDFKVSGFFLFGSPLGIiVIJU:,RKTVMPALEAAQMRPACEQ^ 
SSDSRAT KLDFEVCDFPMFGSPLSVVi:iAARK~r.HDAKAALPRPNCHQV 

YNLFHPADPSASRLEPLIiERRFHALPPFSVPRYQRYPLGDGCSTLLADVLQTHNAAFOEH 
YSFFHCADPSASRLEPLLEPKFHLVPPVSVPRYQRFPLGDGQSLLLADALHTHSPLFLEG 
YNLFHAADPCASRLEPLLAPKFQAIAPLTVPRYQKFPLGDGSSLLLADTLQTHSSLFLEE 
YNLPHPTDPIASRLEPiLSARFSIIAPVNVPRYAKYPLGNGQPLHtLEVXQSHPQHTO^ 

GAPSSPGTAPASRGFRRASEISXASQVS6MAESYTASSIAQKAPDALSHTPSVRRLSLLA 

SSRDSPPLLDAPASPPQASRFQRPGRRMSEGSSHSES SESSDSMAPVGASR 

LEMLVPSTPTSTSG ^.-AFWKGSELATDPPAQPAAP—STTSEWK 

NNLLAGRRIiSDASMQSTISGLIENVSLSTlHA HZ 

LPAP 



SPTTPGPHPPARKASPGLERAPGLPELDrGEVAAKWWGQKRIDYALYCPDALTAFP 

ITAKWWGSKRIDYAL YCPDVLTAFP 
ILERWWGTKRIDYSLYCPEAIiTAFP 
LQNKWWGTKRLDYALYCPEGLSNFP 

TVALPHLFHASYWESTDWSFr,r>RQVMRHD-.NSSILELDGKEVSVFTPSKPRBKWQRKRT 
TVAI,PHLFHASYWESTDWAFILRQVMRYE--SVNIKESARI,DPAALSPANPRBKWLRKRT 
TVTLPHLFHASYWESADWAFILRQVIEKB--RPQIAECE--EPSXYSPAFPREKWQRKRT 
AHAI.PHX.FHASYWESPDVXAFXI.RQXGKFE6XPFVGSNDDKDNASPHPGQPREKWXKKRT 

HVKLRNVTAiraRINDAIJUSrEDGPQVLTGRFMYGPLDMVTLTG 
QVKLRNVTA1«IRANDVXAAEDGPQVI,VGRFMYGPLDMVAX,TGEKVDIL 
QVKXRNVTSNHRASDTWCEGRPQVI,SGRFMYGPLDVVTr.TGEKVDVYXMTQPLSGKWIH 
SVKLKNVAANHRANDVIVQEGREQRLNARFMYGPLDMITLHGEKVDVHXMKDPPAGEWTF 

LDTiVTNNSGRVSYTXPESHRLGVGVYPXKMWRGDHTFADSYXTVLPKGTEFWFSXDG 
LDTEITNSSGRITYNVPRPRRLGVGVYPVKMVVRGDQTCAMSYLTVLPRGMECWFSIDG 
FGTEVTNSSGRX.TFPVPPERALGXGVYPVRMVVRGDHTYAECCLTWARGTEAWFSIDG 
IiSTEVTDKNGRXSYSXPDQVSLGYGXYPVKMWRGDHTSVDCYMAWPPLTECWFSXDG 

SFAASVSXMGSDPKVRAGAVDWRHWQDLGYLIIYVTGRPDMQKQRWAWLAQHNFPHGV 
SFAASVSIMGSDPKVRPGAVDWRHWQDLGYMILYXTGRPDMQKQRWSWLSOHNFPOGM 
SFTASVSIMGSDPKVRAGAVDWRHWQDS6YLXVYVTGRPDMQKHRWAWLSQHNFPHGV 
SFTASMSVTGRDPKVRAGAVDVCRHWQBLGYLLIYXTGRPDMQQQRWSWLSQHNFPHGL 

VSFCDGLVHDPLRHKANFLKLLXSELHLRVHAAYGSTKDVAVYSAISLSPMOXYXVGRPT 
XFFSDGI,VHDPLRQKAIFLRNLMQECFXKISAAYGSTKDXSVYSVI.GLPASQXFXVGRPT 
VSFCDGLTHDPLRQKAMFLQSLVQEVGLNXVAGYGSPKDVAVYTALGLSPSQTYXVGRAV 
XSFADGLSTDPLGHKTAYLNNLVQNHGXSITAAYGSSKDXSVYTNVGMRTDQIFIVGKVG 

KKLQQQCQFXTDGYAAHLAQLKYSHRARPARNTATRMAI^RKGSFGLPGQGDFLRS; 



RNHLL 



sn 



NIRl_Hs KKYQTQCQFi:,SE6yAAHIAVLEASHRSRPKKNNS-RMILRKGSFGLHAQPEFLRKRNHr.R 

PITPNM_HS RKLQAQCQFLSDGYVAHLGQLEAGSHSHASSGPP-RAAI.GKSSYGVAAPVDFLRK0SOLL 

CGI 1 1 1 1-PB KKLQSNATVLSDGYAAHLAGLQAVGGSRPAKGNA-RMVIPRGCFNLPGQTANPRRR--— 

NIR3_Hs RTISAQ PSGPSHRHERTQSQADGEQRGQRSMSVAAGCWGRAMTGRLEPGAAAGPK 

NIRl_as RTMSVQQPDPPAANPKPERAQSQPESDKDHERPLPALS— WARGPP-KFESVP 

ll'^^^'^T^^ ^ SRGPSQAEREGPGTPPTTLARG-KARSISLKLDSEE - 

CGlllll-PB RLHEQATNEN 



r.'®"Z^^.*' "triglyceride content of a Drosophila CG3811 (GadFly Accession 
Number) mutant 
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